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An L 1 ergodic theorem with values in a nonpositively 
curved space via a canonical barycenter map 



Andres Navas 



o 

v . 

Q ■ The extension of classical ergodic theorems to a geometric -nonpositively curved- setting 

has been one of the most fascinating developments of Ergodic Theory over the last years; see 
[6] for a nice survey containing most of the relevant results for functions (cocycles) taking 
values in isometry groups. 

In a different though related direction, A. Es-Sahib and H. Heinich proved in [3] an ergodic 
type theorem for L 1 i.i.d. random variables taking values in a nonpositively curved space. An 
analogous result for L 2 i.i.d. random variables was given by K.-T. Sturm in [TJ. Recently, 
T. Austin proved a nice extension of Sturm's result to arbitrary measure-preserving actions of 
amenable groups (see [I]). Unfortunately, Austin's L 2 -setting is not the most appropriate one 
in view of that the most powerful framework of the ergodic theorem is that of L 1 spaces. In 
this work, we prove a general ergodic theorem for L 1 functions taking values in nonpositively 



curved spaces, where the notion of Birkhoff sums is replaced by that of barycenters along the 
orbits. 

l/-) , Let us begin by recalling a classical construction. Given a complete CAT(0)-space (X,d), 

' we consider the space P 2 (X) of probability measures with finite second moment, that is, 
O " 

d(x, y) 2 dfj,(y) < oo 

x 

(this condition does not depend on the point x G X). Following Cartan (see for instance 
[5]), to each /iGP 2 (X) one may associate a barycenter bar(^i), namely the unique point that 
minimizes the function 

x — > / d{x, y) 2 dfi(y). 
Jx 

A crucial property of bar: P 2 (X) — > X is that it is 1-Lipschitz for the 2-Wasserstein metric [7J: 



d(bar(ni),bar(/j, 2 )) < W 2 (ni, H2) '■= inf \ d(x,y) 2 dv(x,y), 



I XxX 



where (yu.1l//2) denotes the set of all probability measures v on X x X that project into \i\ and 
fi2 on the first and the second factor, respectively (see [8] for more details on this metric). 

The first task of this work was to introduce an analogous notion for the space P 1 (X) of 
probability measures with finite first moment: 

d(x, y) dfi(y) < 00. 

x 
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It was after we developed a notion of barycenter adapted to our needs that we discovered the 
equivalent construction of jl]. We decided to include our approach here because it is more 
elementary in that, unlike [4], it does not rely on deep probabilistic results. Although this 
makes our computations a little bit more involved, it has the advantage of allowing us to 
avoid the (finite) local compactness hypothesis of jl] for the underling space, thus solving a 
problem formulated in [7J Example 6.5]. Summarizing, let (X, d) be a complete metric space 
with nonpositive curvature in the sense of Buseman (a Buseman space, for short). Assuming 
that X is separable, in ^1] we construct a map bar* : P 1 (X) — > X that is 1-Lipschitz for the 
1-Wasserstein metric: 



d(bar*(fii),bar*(fi 2 )) < W\(fii, fi2) inf / d(x,y) du(x,y). 

By elementary reasons, this also applies to any separable Banach space, where geodesic are 
understood as being segments of lines. 

The map constructed above is equivariant with respect to the natural action of isometries. 
At the end of §H we give an application of this fact, namely, we prove that every compact 
group of isometries of a Buseman space has a fixed point. The novelty here is that we do not 
assume any hypothesis of strict convexity (having such an hypothesis, the result is elementary 
and well-known). 

We next enter into the goal of this work. Given an amenable group G with a measure- 
preserving action T on a probability space (Q,V), let (F n ) be a tempered F0lner sequence in 
G, that is, a F0lner sequence for which there exists C > such that for all nGl, 



( |J F fe - x F n ) < Cm G (F n ), 



m G 

k<n 



where mg denotes the left Haar measure on X. Let ip : Q — » X be a measurable function 
lying in ^(V, X), that is, such that for some (equivalently, all) x E X, 



d(ip(cu),x) dV(co) < oo. 
Notice that ^(V, X) becomes a metric space when endowed with the distance 



di((p,i/)) := J j d(ip{u),ip{uj))dV(uj). 

Main Theorem. With the notation above, assume that X is either a separable Banach space 
or a separable Buseman space. Then 



u i y bar* ( ^— / <5 v p* w ) dm G {g) J 

\m G {F n ) J Fn VK J 



is a sequence of maps that converges pointwise and in L l (V,X) to a T -invariant function 
from Q to X . 
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For Banach spaces, the barycenter of a measure —(6 Xl + ■ ■ • + 5 Xrn ) is just the Dirac 
measure concentrated at the point — (x\ + • • ■ + x m ). In particular, when G ~ Z, X = K and 
_F n = {0, . . . , n — 1}, the theorem reduces to the classical (invertible) Birkhoff ergodic theorem 
for ip G L l {V : M). 

The proof of the Main Theorem uses the general strategy of pQ, namely the contractivity 
properties of the barycenter maps transforms the desired convergence into that of suitable 
sequences of real-valued functions to which Lindenstrauss' pointwise ergodic theorem [3] ap- 
plies. Recall that in the setting of PQ, the probability measure lies in P 2 (X) and one considers 
functions cp : ft — > X lying in the space L 2 (V,X), that is, such that for some (equivalently, 
all) x G X, 

d((p(u),x) 2 dV(u) < oo. 
This space may be naturally endowed with the distance 

d 2 {ip,^):= / d((p(u),tp(u)) 2 dV{u). 
Jn 

Austin's theorem then asserts that for every ip G L 2 (V,X), the sequence of maps 

u^bart tttt / S ( T3uj) dm G (g)) (1) 

\m G {F n ) J Fn VK J 

converges pointwise and in L 2 (V,X) to a T-invariant function from Q to X. 

Quite interestingly, Austin's theorem is not a consequence of our Main Theorem. Indeed, 
although -as in the classical case- our theorem extends to an L p - version by a straightforward 
and well-known argument, the barycenters bar and bar* may differ, even for very nice spaces; 
see Remark 11.31 Despite of this, the map bar is also 1-Lipschitz for the 1-Wasserstein metric; 
see [7J Proposition 4.3]. Using the methods of £]2j this allows showing that the convergence 
of the sequence of maps JT]) actually holds in X). We point out that this still holds 

for probability measures in P 1 (X) for a clever modification of Cartan's barycenter (see [7J 
Proposition 4.3]). 

Acknowledgments. It is a pleasure to thank A. Karlsson for useful hints and references concerning 
fixed points for actions on Buseman spaces, J. Bochi for inspiring discussions on the barycenter map, 
and K.-T. Sturm for a clever remark. 

This work was funded by a Fondecyt Research Project and the Math-AMSUD Research Project 
DySET. 

1 The barycenter map 

For a Banach space X, a natural definition of barycenter of a measure // G P l (X) is 

bar*(fj,) := / xdji{x). 
Jx 

Notice that given in P 1 (A), for each v G ( / u 1 |/i 2 ) we have 
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\x - y\\dv(x,y) > 



XxX 



(x - y)dv{x,y) 



XxX 



xdv(x, y) — 



XxX 



x 



ydv(x,y) 



XxX 



xd{-K x v){x) - / yd(x//)(y) 
J} 

bar*(ni) — bar*(fi 2 



x 



As a consequence, 

||6ar*(/xi) -6ar*(/i 2 ) || < /x 2 )- 

A definition with an analogous property for nonpositively curved spaces is much more 
subtle. In what follows, X will denote a Buseman space (separability will be needed later). 
Recall that this means that X is geodesic and the distance function along geodesies is con- 
vex. Equivalently, given any two pairs of points x, y and x',y', their corresponding (unique) 
midpoints m, m! satisfy 

(2) 



d{m, m ) < h 



2 2 

This property allows defining a barycenter bar n (xi, . . . , x n ) of any finite family (xi, . . . , x n ) 
of (nonnecessarily distinct) points as follows. For n = 1, we let bar±(x) := x. For n = 2, 
we let bar2(x,y) be the midpoint between x and y. Now, assuming that the barycenters 
bar n (-, ...,•) of all families of n points have been defined, we define bar n+ i(xi, . . . , x n , x n+ i) 



[x 



(0) 



„(0) 



1 1 



i X n+l)i 



as follows: Starting with (xi,..., x n+ i) 

defined) barycenter of (xi, . . . , x i+1 , . . . , x n+1 ). Then we do the same with the resulting 
set {xi , . . . , thus yielding a new set (xf^ 

passing to the limit along the Cauchy sequences (x\ k ^ 
to a single point, that we call the barycenter of (xi, . . . , x n+ \). The proof of this convergence 
will be accomplished inductively together with the following crucial relation: 



we replace each Xi by the (already 

}. Repeating this procedure and 
k e n, the corresponding set with collapse 



, . . . , x n+ i 



1 n 

d{bar n (xi, ...,x n ), bar n (x u x n )) < - ^ d(x h y t ) 

i=l 



(3) 



First, for n = 2, the barycenter is already defined, and ([3]) reduces to ([2]). Now, assuming 
that we have showed the existence of the barycenter as well as inequality ()3]) for families of n 
points, let us consider a family (xi, . . . , x n+ i). For each % ^ j in {1, . . . , n + 1}, we have 



d(x\ \ x)') = d(bar n (x 1 ,...,x i _ 1 ,x i+1 ,...,x n+1 ),bar n (x 1 ,...,x j _ 1 ,x j+1 ,...,x n+1 )) < 
Therefore, 



G?(Xi, Xj) 



n 



diam{xi , . . . , x^L} < — diam{x\ : . . . , x n+ i}, 



n 



and more generally, for all k > 1, 



diam{xi , . . . , x^^} < —r diam{x\, . . . , x n+ i}. 



rr 
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By this inequality and Lemma [TTTl below. the diameter of the convex closure of {x[ , . . . , a^+i} 
converges to zero as k goes to infinite. Since x\ belongs to this convex closure for all I > k, 
this shows that bar n+ i(xi, . . . , x n+ i) is well defined. 

Next, take two families (xi, . . . ,x n+ i) and (yi, . . . ,y n+ i). By the inductive hypothesis, for 
each index i G {1, . . . , n + 1}, 

d(xf]yf ] ) = d(bar n (x 1 , ...,x i - 1 ,x i+1 , ...,x n+1 ),bar n (y u y^, y i+1 , ...,y n+1 )) < -^2 d ( x jiVj)- 
Summing over alH = 1, . . . , n + 1, this yields 

n+1 n+1 

E^?U w )<E d (^)- 

i=l i=l 

More generally, for all k > 1, 

n+1 n+1 n+1 

i=l i=l i=l 

Letting /c go to infinite, all the points (resp. converge to bar n+ i(xi, . . . , x n+ i) (resp. 
bar n+ i(yi . . . , y n +i)- Hence, passing to the limit in the previous inequality, we obtain 

n+1 

(n + l)d{bar n+ i(xi, . . . , x n+1 ) , bar n+1 {y x . . .,y n +i)) < /^(x^yj), 

i=i 

as we wanted to show. 



Lemma 1.1. The diameter of the convex closure of every bounded subset of X equals its own 
diameter. 

Proof. An explicit inductive description of the convex closure of a bounded subset B of X 
{i.e. the smallest convex subset of X containing B) proceeds as follows. Letting B := B and 
having defined B X) . . . ,B n , we let B n+1 be the union of all geodesies with endpoints in B n . 
Then B n C B n+ i, and the closure of the union B^ := \J n B n is the convex closure of B. Since 
-Boo contains B, we have diam{B 00 ) > diam(B). To show the converse inequality, it suffices 
to show that for all n > 0, 

diam(B n+ i) < diam(B n ). (4) 

To check this, given arbitrary points x, y in B n+ i, we may find xo,xi and yo,yi in B n such 
that x (resp. y) lies in the geodesic joining x and Xi (resp. y and yi). The convexity of the 
distance along geodesies shows that 

d(x,y ) <max{d(xo,yo),d(x 1 ,yo)} <diam(B n ), 

d(x,yi) < max \d(xo, yi), d(x±, yi)} < diam(B n ). 
Another application of this convexity then shows that 

d(x,y) < max {d(x, yo), d(x, yi)\ < diam(B n ). 

Since x, y were arbitrary points of B n+ i, this shows (j4]). □ 
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By the symmetry of the construction, for every permutation a of {1, . . . , n}, 



bar n (xi, ...,x n ) = bar n (x a{1 ), x a{n) ). 
Having this in mind, fl3]) implies that 

1 n 

d(bar n (xi, ...,x n ), bar n (y u y n )) < - min V] d(x u y a u)). 

i=l 

The important observation here is that (by a theorem of Garrett Birkhoff; see [El Introduction]) 
the right-side expression above corresponds to the 1-Wasserstein distance between certain 
probability measures. More precisely, 

1 - 

- min d(xi, y^) = n 2 ), 
i=i 

where \i\ := \{&x x + • • * + $e„) an d \i 2 ■= h($yi + ' ' ' + ^y„)- I n order to obtain a barycenter map 
that is 1-Lipschitz for the 1-Wasserstein metric, this would motivate to define the barycenter 
of -(S Xl + • • • + 5 Xn ) as bar n (x\, . . . ,x n ). However, such a definition is not intrinsic. For 
instance, though the n-set (xi, x 2 , . . . , x n ) and the 2n-set (xi, Xi, x 2 , x 2 , ■ ■ ■ , x n , x n ) should be 
identified as measures, the points bar n (xi,x 2 , ■ ■ ■ , x n ) and bar 2n (xi,Xi, x 2 , x 2 , . . . , x n , x n ) do 
not necessarily coincide. As a matter of example, the reader may easily check that for X 
being a tripod of endpoints x,y,z and edges of the same length £, the points bar±{x,x,y,x) 
and bg(x, x, x, x, y, y, z, z) are different. (The former is at distance ^ from x, while the second 
is at distance form the same vertex; see Figure 1.) 

x 



bar±(x, x, y, z) 



bar&(x,x,x,x,y,y,z,z) 




y z 
Figure 1 

To solve the problem above, we will slightly modify the definition of the barycenter of finite 
families of points so that it becomes invariant under the procedure -at the level of measures- 
of "subdivision of mass along the atoms". Given an arbitrary family Q = (x\, . . . ,x n ) of 
points in X, we let 

Q • • • • > •Ejij 2*1 > • • • > ^ni • • • j -Eli ■ ■ ■ j ^n); 

where the number of blocks is k. 

Proposition 1.2. The sequence of barycenter s bar n k(Q h ) is a Cauchy sequence. 
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Assuming that this proposition holds, and since X is supposed to be complete, we may 
define the (canonical) barycenter 

as the limit point of the sequence bar nk (Q k ). Indeed, one readily checks that this limit point 
depends only on the corresponding measure and not on any particular way of writing it as 
a equally weighted mean of Dirac measures (with nonnecessarily different atoms). Moreover, 
we still have the crucial relation 

(1 1 \ 1 n 

bar*(-(5 Xl ^ h 5 Xn ) J , bar*( -(S X1 H \- S Xn )j ) < - min ^ d{x h y^). 
' i=l 

Thus, denoting by Pq(X) the set of atomic probability measures on X all of whose atoms have 
rational mass, we have a well-defined map bar* : Pq(X) — > X, and the previous inequality 
translates into that this map is 1-Lipschitz for the 1-Wasserstein metric: for all 1^1,1^2 i n 

d(6ar*(/xi),6ar*(/x 2 )) < W^i, aO- (5) 

If X is separable, then it is known that Pq(X) is IVi-dense in P 1 (X). We may hence extend 
the map bar* to all P 1 (X) so that ([5]) holds for all fii, 1^2 in -P X (A). This concludes our 
construction. 

Remark 1.3. It is worth pointing out that for CAT(0)-spaces, bar* does not necessarily 
coincide with the Cartan barycenter. Indeed, for the example illustrated by Figure 1, the 
Cartan barycenter of the measure 4f + x + if ls the origin, though the barycenter bar* of 
this measure lies on the axis joining the origin to x (see the final remark of [4^ Section 1.2]). 

To close this section, we next give a proof of Proposition 11.21 Let us mention that this 
proposition is also proved in [1] by means of a quite indirect argument that uses a deep 
martingale theorem and requires X to satisfy a weak local-compactness property. Although 
this very elegant approach does not seem to be the most appropriate one in view of the purely 
geometric nature of the statement, the reader will still recognize a certain probabilistic flavor 
in our computations below. The key estimate for the distance between the barycenters of Q k 
and Q k+l is provided by the next 

Lemma 1.4. For every 1/2 < a < 2/3, there exists a constant C = C(a) > and L ^> 1 
such that for all positive integers I, k satisfying L < I < \/k, one has 

d{bar nk (Q k ),bar n{k+l) (Q k+l )) < CD—, (6) 

where D denotes the diameter of the set {xi, . . . ,x n }. Moreover, for < I < L, one still has 
the weaker estimate 

d(bar nk (Q k ), bar n{k+l) (Q k+l )) <D~. (7) 
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Assuming that this lemma holds, let us prove Proposition 11.21 Given e > 0, fix an integer 
k £ > max{L, 10} such that 

D 3 3 ~ 3a CD 

h + (2-3a)(A; £ -l) 2 -3« < £ > 

where C is the constant provided by Lemma 11.41 For any k± < k 2 larger than k e , define 
the sequence (£j) by £% := k\ and £j + \ := £j + \\ftj\- One readily checks by induction that 
£j > {k £ + j) 2 /9 holds for all j > 1. Choose m > 1 such that £ m < k 2 < £ m +i- By Lemma Tl. 41 

d(bar nti {Q t '),bar n t i+1 (Q ti+1 )) < CD ~ ^ < J = l,2,...,m- 1. 



2 

Moreover, 



1/2- 



Using the triangle inequality, this yields 

d(bar nkl (Q kl ),bar nk2 (Q k2 )) < -tj^ + J^^M 



< 



1/2 

«rk j= i £. 2 

D ^ &-3<*CD 



3-3a 



D r°° dr 

< ^ + 3 3 ~ 3a CD ' 



U I ™3— 3a 

L> 3 3 - 3a CD 
~ h + (2-3a)(k £ -iy- 3 * < £ ' 

thus showing the Cauchy property. 

It remains to prove Lemma 11.41 The starting remark is given by the next 

Lemma 1.5. Given integers 1 < I < m and points x, y\, . . . , y m in X, the distance between x 
and bar m (yi, . . . , y m ) is smaller than or equal to the mean distance between x and the points 
of the form bar m -i(yx, y^, , y m ), where i\, . . . ,ii range over all possible choices 

of different values in {1, . . . , m} (and each weight equals m{m — 1) • • • (m — I + 1) = j^zjy )- 

Proof. For 1 = 1, this follows as an application of (EJ) to 

bar m (y h . . . ,y m ) = bar m {bar m _i(yi, ...,y h .. .,y m ), i = 1,. . . ,m). 

The general case easily follows by an inductive argument using again (j3J). □ 

The idea of the proof of Lemma 11.41 consists in viewing the process of "reduction of coor- 
dinates" for passing from Q k+l to Q k as a random process, which should imitate a Bernoulli 
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trial for large values of k ^> I (this process has an hypergeometric multivariate distribution). 
For each index j, the final associated error (i.e. the difference between I and the number of 
deleted entries Xj) should be -in mean- much smaller than In. This allows passing from the 
elementary though useless upper bound ~.DZ/fc for the distance between the barycenters to 
the much better upper bound ~CT>Z 3a ~ 1 jk. 

Proof of Lemma 1.4 As explained above, estimate ([7]) follows as a direct application of 
Lemma ll.5[ so let us concentrate on ([6]). Lemma 11.51 again implies that the distance from 
bar n k(Q k ) to bar n (k+i)(Q k+l ) is smaller than or equal to the mean of the distance between 
bar n k(Q k ) and the points bar n k(yi, . . . ,Vkn), where (yi, . . . ,ykn) ranges over all families that 
coincide with Q k+l except for the deletion of In entries. Among these families, the number of 
those for which the deleted entries correspond to a Xj-position a number of times equal to ij 
(with %\ + h i n — nl) is 

f k + A (k + l\ fk + V 



Moreover, the distance from the barycenter of such a family to bar(Q k ) is smaller than or 
equal to 

jr(\ii-i\ + \i2-i\ + --- + K -i\). 

By Lemma [1.51 this implies that d(bar n k(Q k ),bar n ^+i)(Q k+l )) is smaller than or equal to 

ilH \-i„=nl \ nl I 

n nl (k+l\ /(n-l){k+l)\ „ nl-l (k+l\ /(n-l){k+l)\ „ I (k+l\ /(n-l)(k+l)\ 

U ST^ \ j )\ nl-i ) I ■ /I _ U \l+i)\ nl-l-i )_ ■ , nl-l+i ) 

~ 2-^i (n(k+l)\ I k ^ (n(k+t)\ ~^ 2-^ (n(k+l)\ 

i=0 \ nl ) i=0 \ nl ) i=0 \ nl ) 



I. 



We will estimate the first of the two sums above, leaving to the reader the task of carrying 
out analogous computations for the second sum. First, notice that 

n nl-l (k+l\ ((n-l)(k+l)\ „ l a (k+l\ ((n-l)(k+l)\ „ nl-l (k+l\ ((n-l)(k+l)\ 

L> \^ Vl+i) \ nl-l-i I ■ ^ \^ \l+ i) \ nl-l-i ) ■ . V \^ U+iA nl-l-i ) ■ 

k (n(k+l)\ 1 2^ (n(k+l)\ % + (n(k+l)\ 1 

i=0 \ nl J i=0 \ nl ) i=l a \ nl ) 



< 



< 



2-^ fn(k+l)\ ' U 2-^ (n(k+l)\ 

i=0 \ nl J ' i=l a \ nl ) 

Dl* , Din -1)1 , ^(K)( ( -a-t°) 



k k V ^ (n(k+l)\ 

i=0 \ nl I 

The proof will then follow from an estimate of the form 



i=Q 



(k+l\ ({n-l)(k+l)\ n 
\l+i) \ nl-l-i J < ^ 

^n(fc+Z) j — /2-3o ' 
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To show this, first rewrite 



(k+l 



)( (n ^-T°) ( fe r')( ( 7n-i)f) Kk - !).-.(* - i + 1) ((n - l)0((n - 1)/ - l)-((n - 1)Z - i + 1) 



(»(*+*)) (»(*+9) (/ + i)(/ + 2)...(/ + i) ((n-l)fc)((n-l)fc + l) — ((n-l)Jfe + i) 

Now, using the improved version of Stirling's inequality (see (2J Chapter II. 9]) 

/m\ m i < fm\ m _j_ 

V27rm^ — J e 12m +! < m! < V27rm^ — J e 12m , 

9 g 

one easily checks that for a certain e 12 (' +1 > < A < ewi, 



/k+i\ /(t»-i)(fc+i)\ / 
U+iJ I (n-l)J-i J . / (k + l)n 



("(J+0) A y 2^/(72 - 1) " A V 27r/(n - 1) ' (9) 

On the other hand, choosing L ^> 1 and c > such that | log(l + x) — a;| < ex 2 holds for all 
\x\ < 1/L 2 ~ 2a , for all I > L we have 



log 



k{k - 1) • • • (A; - % + 1) ((n - l)0((n - 1)/ - 1) • ■ • ((n - 1)1 - i + 1) 
(/ + !)(/ + 2) •••(/ + i) ((n - l)fc)((n - l)fc + 1) • • • ((n - Tjk + i) 



log 



i-1 



(i+f)(i+f)---(i+f)(i+(^)---(i+(^: 



j — 1 j t— 1 I .Q . Q 

(n - 1)1 ~ 2s T ~ 2s J ~ 2s (n - l)k ~ C \p)~ C \¥ 

m=l v ' m=l m=l m=l v ' 

> i2fl _ ^~ 2 ) _ 4d 3*-2 

~ 2l(n-l) 2l(n-l) 

Putting this together with and using the inequality 1 — x < e~ x , one easily concludes that 
the expression 



P fk+l\ /(n-l)(k+l)\ 
V nl—l-i ) 



/;<(/.-+/! 
i=0 ^ nl 

is larger than or equal to 



- l<x 

C \ \ -v / n » 2 n 



72-3a / 



2(n-l)i 



The involved series can obviously be compared with an integral: 

.9 

I " - / " v^ e 2<n ~ 1)! / " / - "-1 



1 /' r V / T^T)I .2 f 00 - :_ / ~ — ~ 

> / e -—dx=l- e~~dx>l-2e 2 V («-i)f. 
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Putting all of this together one easily obtains OH]), which concludes the proof. 



□ 



An application: a fixed point theorem. By construction, the map bar* is equivariant un- 
der the action of isometries. As a consequence, every action of a compact group by isometries 
of a Buseman space has a fixed point. Indeed, the push-forward of the Haar measure along 
an orbit is an invariant probability measure for the action. By equivariance, the barycenter 
bar* of this measure must remain fixed. 

Despite the simple argument above, it is worth pointing out that a much stronger result 
holds: if a group action by isometries of a Buseman space has a (nonempty) compact invariant 
set, then it has a fixed point. (In particular, actions on a proper such space with bounded 
orbits must have fixed points.) Although the author was convinced that this was pretty well- 
known, according to the specialists it is apparently new, so we sketch the argument of proof 
below (the details are left to the reader). 

We will use the following construction. Given a compact subset B of X, we let B* be the 
set of all midpoints between points of B whose distance realizes the diameter. By Lemma 11.11 

diam(B*) < diam(B) =: D. 

Moreover, if equality holds, then there are points x\, x 2 , £3,^4 in B such that the distance 
between any of them equals D. Indeed, let y, z in B* be such that d(y,z) = D. Let Xi,x 2 
(resp. 23, £4) be points in B such that y (resp. z) is the midpoint between x\ and x 2 (resp. 
x 3 and £ 4 ) and d(xi,x 2 ) = rf(x 3 ,x 4 ) = D. Using 

n ,/ v . d(x u x z ) d{x 2 ,x 4 ) 

D = d(y, z) < + < D, 

we conclude that d(xi,x 3 ) = d{x 2) Xi) = D. Similarly, using 

D = d{y,z)< d{x f i K d{x f* ) < D i 

we conclude that d{xi,Xi) = d(x 2 ,xs) = D. 

The preceding argument easily allows to show the following generalization: starting with 
B\ := B of diameter D, define inductively B n := (5 n _i)*. If diam(BN) = D, then there exist 
2^ points x\, . . . , x 2 n in B such that the distance between any of them equals D. 

Assume now that T acts on X preserving a compact set B. Compactness type arguments 
easily yield a compact invariant subset B of B of minimal diameter D. We claim that B is 
a single point (hence a fixed point for the action). Indeed, assume otherwise and cover B 
by finitely many (say, M) open balls of radius D/2. Since all the B n 's are also compact and 
invariant, the minimality of D yields diam(B n ) = D for all n > 1. Fix iV such that 2 N > M. 
By the discussion above, there exists a sequence of points x\ , . . . , x 2 n in B such that the 
distance between any of them equals D > 0. However, this is impossible by the choice of N. 

2 The L 1 ergodic theorem 

To simplify, given (p : Q — > X, let us denote 

m G {F n ) J Fn 
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the n th empirical measure associated to (p. Notice that for all (p, ip in X) and all n > 1, 



d(har* [^ n ^(uf)) ,bar* (ii n ^(uj)) J c£P(u;) 



d(bar*( 7— . 

1 



m G (F n ) J Fi 



^(T9u,)dm G (g)\ dV(ui 



< I tttt / d{rtT!>w),-il>(T 9 u))dm G (g)dP(u>) = / <</>(u;), ^(w)) dP(w), 



hence 



d(bar*(p niV (u)),bar*(fi n)V (u)))dV(oj) < di(<p,i}>). 



(10) 



To prove the Main Theorem, let us first assume that <p takes values in a finite set, say 
{xi, . . . , Xfc}, and let fli be the preimage of {xi} under (p. A direct application of Lindenstrauss' 
ergodic theorem [3] to the characteristic function of Q{ yields the existence almost everywhere 
of the following limit: 

:= lim ^(fa 6 ^^}) 

n^oo m G (F n ) 
We claim that almost surely we have the convergence 

k 



bar*(/i n!ip ) — > bar*(^2\i(u)8 x /j. 



Indeed, since bar* is 1-Lipschitz for W%, given e > we have that for almost every u £ Q 
there exists n(w,e) > 1 such that for all n > n(u,e) the following holds: 

d^M^&wsj) < Wl ( E mG{{9 £ ^ }) s Xi ,± x^) 



i=l 



i=l 



i=l 

< e. 



A; (a;) 



diam{xi, . . . , x^} 



This shows the convergence (flTj) . Now notice that by construction, both bar*(fi n ^) and 
6ar*( 52i=i A«( w )^xJ belong to the convex closure of {x\, . . . , x^}. By Lemma [TTT| this im- 
plies that for all n > 1, the distance between these two points is less than or equal to 
diam{x\, . . . ,Xf.}. A direct application of the dominated convergence theorem then shows 
that the convergence (fTTl) also holds in L l (V,X). 

In order to deal with the general case we will need the next 

Lemma 2.1. There exists a constant C > (depending only on the sequence (F n )) such that 
for all ip, ip in L 1 (X, //) and all X > 0, 



V 



u eQ: supd(bar*(p n #(u)),bar*(fi n rf(u))) > A 



n>l 



C 

< —di(ip,ip). 



(12) 
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Proof. Since bar* is 1-Lipschitz for Wi, the set involved in the inequality above is contained 
in {a; G Cl : sup n>1 W\ \H n ,ipi AWv — Now, noticing that the measure 



m G {F, 



n) JF n 



fi(<p(TBu3),i>(TBu)) dm G (g) 



lies in (fJ, n ,ip\n>n,i>), we obtain 



d(<p(T*u>),1>(T°u))dm G (g). 



m G (F n) JFn 

Thus, the left-side expression of ( |T2|) is smaller than or equal to 

1 



V 



weO: sup 



n>i m G (F n ) Jf„ 



d(y(T°u),if)(T°uj))dm G (g)>\ 



Now, a direct application of Lindenstrauss' maximal ergodic theorem (see [3J Theorem 3.2]) 
yields the existence of a constant C > (depending only on (F n )) such that this last probability 
is smaller than or equal to 

- / d(p(u),iP(u))dV(u), 
A Jn 



as desired. 



□ 



We may now proceed to complete the proof of the Main Theorem. Since X is assumed to be 
separable, for each ipEL 1 ^ \X) there exists a sequence of finite- valued functions (pp.: Cl ^ X 
that converges to ip in the L 1 sense. Thus, given e > 0, we may fix if) := such that 
d^ip^Ke 2 . ByflTJ]), 



V 



n>l 



E Cl: sup d( bar* (fx n!ip (uj)), bar* (n n ^(u)) ) > £ 



C 

< -di(<p,if)) < Ce. 



Since bar*([A n ^) is known to converge almost everywhere, this inequality implies that on a set 
of measure at least 1 — Ce, the sequence (bar* (jj, ntlfi (co))) asymptotically oscillates by at most 
2e. Since this is true for all e > 0, this shows that bar* (n n ^(uj)) converges almost surely. 
Finally, to show the convergence in L l (Cl,X), just notice that by (flOl) . 



J d(bar*((i ntV (u)),bar*((i m ,<p(u))^ dV{u)) 

< J d(bar*(fj, n ^),bar*(^ n ^ k fj+d(bar*(fj, niV J,bar*(^ dV(u) 

< 2d\((p, (fk) + J d(bar*(fj, njlfik ),bar*(fj,m jlfik 

For a given e > 0, we may fix k large enough so that di(p,p k ) < £/3. Since (bar* (ii njipk )j 
converges in L 1 (V ) X) as n goes to infinite, we may fix n e so that for all n, m larger than n £ , 



J d(bar*((i n , Vk ),bar*((i m> <p k )) dV(u) < 
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Putting all of this together we obtain that for all n,m larger than n E , 

d(bar*(ii n , !ip (uj)),bar*(ii m ^(uj)) S j dV(u) < e. 
Hence, (bar* ((p nj(p )) is a Cauchy sequence in X), as we wanted to show. 
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